CLAIMS : 

What is claimed is: 




>< y£ - ^ method of ^omputing a model, comprising: 
/ comparing attribute values for samples having a desired 

/ attribute to attribute values for all samples; and 

selecting a subset of available attributes based on a 
difference between attribute values for the samples having 
the desired attribute ajid attribute values for all of the 
samples . 

2. The method ok claim 1, wherein the step of comparing 
attribute values for ^amples having a desired attribute to 
attribute values for(a^.l samples further comprises: 

determining a ^pVtistical measure of difference between 
the attribute values for samples having the desired 
attribute and the attribute values for all of the samples. 

3. The method of cslaim 2, wherein the step of determining 
a statistical measure of difference between the attribute 
values for samples having the desired attribute and the 
[attribute values for aM of the samples further comprises: 

determining an entropy for the attribute values. 

4. The method of claim ]\ wherein the step of selecting a 
subset of available attributes based on a difference between 
attribute values for the samples having the desired 
attribute and attribute valued for all of the samples 
further comprises: 

identifying n attributes having a largest difference in 
attribute values. 
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5. The method of claim 1, wherein the step of selecting a 
subset of available attributes based on a difference between 
attribute values for the samples having the desired 



attribute and attribute values for all of the samples 
further comprises 

identifying ^ predetermined percentage of attributes 
having a larger difference in the attribute values than 
remaining attribute^ 

6. The method of cl^aim 1, wherein the step of selecting a 
subset of available attributes based on a difference between 
attribute values for tne samples having the desired 
attribute and attribute yalues for all of the samples 
further comprises ; 

identifying attributes having a difference in the 
attribute values exceeding^ predetermined amount. 

7. The method of Vlaim 1, further comprising: 
obtaining a plurality of samples, each sample having 

values for a plurality/ \pf attributes. 

8. The method of claim 1, further comprising: 
employing the selected subset of attributes to generate 

a predictive model. 
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1 9. A method of computing a model, comprising: 

2 obtaining a plurality of samples each having values for 

3 a plurality of attributes; 

4 comparing attribute values for samples having at least 

5 one desired attribute to attribute values for all of the 

6 plurality of samples; 

7 selecting attributes having a largest difference 

8 between attribute valyes for samples having the at least one 

9 desired attribute and\attribute values for all of the 

10 plurality of samples; and 

11 computing a model employing the selected attributes. 

P io. The method of claim J&^Wherein the step of selecting 

|y attributes having a largest difference between attribute 

M values for samples having the at least one desired attribute 

|p and attribute values for alA of the plurality of samples 

SH- further comprises: \ 

§* identifying a predetermined number of attributes having 

C ! i the largest difference in attribute values. 

|J, 11. The method of claim 9, wherein the step of selecting 

2Q attributes having a largest difference between attribute 

T values for samples having the at reast one desired attribute 

4 and attribute values for all of thA plurality of samples 

5 further comprises: \ 

6 identifying a predetermined percentage of attributes 

7 having the relative difference in attribute values. 

1 12. The method of claim 9, wherein theVstep of selecting 

2 attributes having a largest difference between attribute 

3 values for samples having the at least one desired attribute 

4 and attribute values for all of the pluraMty of samples 
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further comprise^ 

identifyi^r ^attributes having a difference in attribute 
values equal to or greater than a predetermined amount . 
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13. A method^ of selecting attributes for computing a model, 
comprising : 

for a plurality of samples each having values for a 
plurality of attributes: 

for eac\i of the plurality of attributes: 

comparing the attribute values for a first 
group of samples to the attribute values for all 
of the plurality of samples; and 

determining a difference between the 
attribute values for the first groups and the 
attribute values for all of the plurality of 
samples; and 

identifying attributes within the plurality of 
attributes having a largest difference between the 
attribute values for the first groups and the attribute 
values for all of the plurality of samples; and 
selecting at least some of the identified attributes. 
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14. A system >£or selecting attributes for computing a 
model, comprising: 

a memory containing data for a plurality of samples 
each having values for a plurality of attributes; and 

a processor coupled to the memory and executing a 
selection process including: 

comparing\ attribute values for samples having a 

desired attribute to attribute values for all samples; 
selecting ay subset of available attributes based 

on a difference between attribute values for the 

samples having thA desired attribute and attribute 

values for all of bhe samples; and 

employing the se leAed subset of attributes to generate 
a predictive model. 

15. The system of claim 14, wherein the selection process 
determines a statistical measure of difference between the 
attribute values for samples having the desired attribute 
and the attribute values for all of/the samples. 

16. The system of claim 15, wherein the selection process 
determines an entropy for the /attribute values. 

17. The system of claim L4, wherein the selection process 
identifies a predetermined number of attributes having a 
largest difference in t^he attribute values for selection. 

18. The system of claim 14, wherein the selection process 
identifies a predetermined percentage of attributes having a 
larger difference /in the attribute values for selection. 



19. The system/of claim 14, wherein the selection process 
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identifies, for selection, attributes having a difference 
-i-hf* an-rih^ itr v n lnr n nrrrr i iing n-prriirtrrmrnrrj nmnnnt . 
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20. A system for computing a model, comprising: 

a memory containing data for a plurality of samples 
each having values fqr a plurality of attributes; and 

a processor coupled to the memory and executing a 
selection process including: 

comparing attribute values for a first subset of 
the plurality of samples to attribute values for all c 
the samples; \ 

selecting attributes having a largest difference 
between attribute values for the first subset and 
attribute values for al^of the samples; and 

computing a model employing the selected 
attributes. \ 
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21. A computer ^program product within a computer usable 
medium for selecting attributes for computing a model, 
comprising : \ 

instructions for reading values of attributes for a 
plurality of samples; 

instructions for comparing attribute values for sample 
having a desired attribute to attribute values for all 
samples; and \ 

instructions for selecting a subset of available 
attributes based on a difference between attribute values 
for samples having the desired attribute and attribute 
values for all samples. \ 

22. The computer program product of claim 21, wherein the 
instructions for comparing attribute values for samples 
having a desired attribute t© attribute values for all 
samples further comprise: \ 

instructions for determining a statistical measure of 
difference between the attribute values for samples having 
the desired attribute and the attribute values for all 
samples. \ 

23. The computer program product of claim 22, wherein the 
instructions for determining a statistical measure of 
difference between the attribute values for samples having 
the desired attribute and the attribute values for all 
samples further comprise: \ 

instructions for determining an entropy of the 
attribute values for samples having the\desired attribute 
and an entropy of the attribute values for all samples; 

instructions for comparing the entropy of the attribut 
values for samples having the desired attribute to the 
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entropy of tha attribute values for all samples for each 
attribute to determine a relative measure of difference; and 

instructions for comparing the relative measure of 
difference of al A attributes . 

24. The computer program product of claim 21, wherein the 
instructions for selecting a subset of available attributes 
based on a difference^ between attribute values for samples 
having the desired attribute and attribute values for all 
samples further compris^: 

instructions for identifying n attributes having a 
largest difference in the\attribute values. 
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25. A computer program product within a computer usable 
medium for selecting attributes for computing a model, 
comprising: 

instructions fdr comparing attribute values for a first 
group of samples to attribute values for all samples for 
each of a plurality of attributes; 

instructions for (determining a difference between the 
attribute values for tne first group of samples and the 
attribute values for alA of the samples; and 

instructions for selecting a group of attributes having 
a largest difference between the attribute values for the 
first group of samples and\ the attribute values for all 
samples . 
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